NSM: A Distributed Storage Architecture for Data-Intensive Applications
نویسندگان
چکیده
Several solutions have been developed to provide dataintensive applications with the highest possible data rates. Such solutions tried to utilize the available network resources through parallel I/O and TCP/IP tuning in order to achieve a better data throughput. The focus was on achieving the highest possible data rate while other performance enhancements factors were ignored. Furthermore, most of those solutions were point solutions and designed to work in a specific environment for a particular application. In this paper, we introduce the Network Storage Manager (NSM). NSM is a java-based, high-performance, distributed storage system with auto reconfigurability that has been developed in the Distributed Computing Laboratory at Jackson State University. The system is designed as a framework for data-intensive distributed applications of different natures. In addition to an architecture that employs parallelism, scalability, crash recovery, and portability, NSM provides applications with full control to optimize other application-controllable features by allowing applications to fine-tune such features or even plug-in their modules and use it instead of the standard NSM implementation.
منابع مشابه
Network Storage Management in Data Grid Environment
This paper presents the Network Storage Manager (NSM) developed in the Distributed Computing Laboratory at Jackson State University. NSM is designed as a Java-based, high-performance, distributed storage system, which can be utilized in the Grid environment. NSM architecture presents a framework offering parallelism, scalability, crash recovery, and portability for data-intensive distributed ap...
متن کاملDesign and Evaluation of Distributed Smart Disk Architecture for I/O-Intensive Workloads
Smart disks, a type of processor-embedded active I/O devices, with their on-disk memory and network interface controller, can be viewed as processing elements with attached storage. The growing size and access patterns of today’s large I/O-intensive applications require architectures whose processing power scales with the storage capacity. We evaluate a distributed smart disk architecture with ...
متن کاملDistributed and Big Data Storage Management in Grid Computing
Big data storage management is one of the most challenging issues for Grid computing environments, since large amount of data intensive applications frequently involve a high degree of data access locality. Grid applications typically deal with large amounts of data. In traditional approaches high-performance computing consists dedicated servers that are used to data storage and data replicatio...
متن کاملData Replication-Based Scheduling in Cloud Computing Environment
Abstract— High-performance computing and vast storage are two key factors required for executing data-intensive applications. In comparison with traditional distributed systems like data grid, cloud computing provides these factors in a more affordable, scalable and elastic platform. Furthermore, accessing data files is critical for performing such applications. Sometimes accessing data becomes...
متن کاملAgent-Based Query Optimisation in a Grid Environment
+ IASTED International Conference on Applied Informatics, Innsbruck, Austria, February 2001 Abstract The next generation experiments in High Energy Physics are the driving force for setting up an International Data Grid at CERN, the European Organization for Nuclear Research. Hundreds of Petabytes of data will be distributed and replicated all over the globe starting from 2005. In order to anal...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2003